Automatic labeling of Japanese prosody using j-toBI style description

نویسندگان

  • Hiroaki Noguchi
  • Kazuhisa Kiriyama
  • Hiroshi Matsuda
  • Miki Taniguchi
  • Yasuharu Den
  • Yasuhiro Katagiri
چکیده

Speech corpora with prosodic labels are getting more and more important not only for speech synthesis but also for discourse modeling. A widely used labeling system for Japanese prosody, J-ToBI, however, is insufficient for applications like discourse modeling and it even lacks an accurate method for automatic labeling. In this paper, we propose an automatic labeling method for J-ToBI style description of tonal events in Japanese speech, aiming at applying it to a general-purpose labeling of Japanese prosody. The proposed method takes into account the linguistic constraints on the tone structure, which improves the accuracy of automatic labeling. We achieve a fairly good performance in a preliminary experiment using a read speech corpus.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Perceptually based automatic prosody labeling and prosodically enriched unit selection improve concatenative text-to-speech synthesis

Prosody is an important factor in the quality of text-tospeech (TTS) synthesis. Typically, acoustic parameters such as f0 and duration are the only variables related to prosody that are used to determine unit selection. Our study explored adding the explicit use of linguistically and perceptually motivated prosodic categories in unit selection-based TTS. One of our goals was to automate the pro...

متن کامل

Japanese MULTEXT: a Prosodic Corpus

A prosodic corpus of Japanese was developed as a scheduled project by the university researchers in Japan. This paper describes the contents of the corpus: speakers, speaking style, recording conditions, prosodic annotations. The corpus is a Japanese version of the MULTEXT prosodic database of EUROM1. We adopted a J-ToBI prosodic labeling scheme as well as additional labels such as pitich range...

متن کامل

Evaluation of a prosodic labeling system utilizing linguistic information

A prosodic labeling support system has been developed and evaluated. Large-scale prosodic databases are strongly desired for years, however, the construction of databases depend on hand labeling, because of diversity of prosody. We aim at not automating the whole labeling process, but making the hand labeling work more efficient by providing labelers with appropriate support information. A meth...

متن کامل

Automatic ToBI prediction and alignment to speed manual labeling of prosody

Tagging of corpora for useful linguistic categories can be a time-consuming process, especially with linguistic categories for which annotation standards are relatively new, such as discourse segment boundaries or the intonational events marked in the Tones and Break Indices (ToBI) system for American English. A ToBI prosodic labeling of speech typically takes even experienced labelers from 100...

متن کامل

Automatic Assessment of Non-Native Prosody by Measuring Distances on Prosodic Label Sequences

The aim of this paper is to investigate how automatic prosodic labeling systems contribute to the evaluation of non-native pronunciation. In particular, it examines the efficiency of a group of metrics to evaluate the prosodic competence of non-native speakers, based on the information provided by sequences of labels in the analysis of both native and non-native speech. A group of Sp ToBI label...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999